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@ Methods end apparatus for efficient allocation of resources by optimizing nonlinear, convex functions with linear 
constraints. 

fr) A method and apparatus is described for optimally 
locating resources. The optimal allocation is done by 
minimizing a cost (which is a. convex non-finaar function of 
various allocation variables) subject to different constraints 
(which are tin ear functions of the allocation variables). The 
method Initially picks a state of the above variables (xo) in the 
interior of the solution polytope (where the constraints are 
satisfied) and computes successive states xi,X2,- which 
progressively reduces the cost of allocation. The above 
Iteration stops when suitable stopping rules are met 

The method employs (i) an affine scaling transformation (a 
variant of Karrnarkar's projective transformation) of the linear 
constraints, (if) an ellipsoid to sphere transformation of the 
curved cost surfaces, (Si) a potential search scheme on the 
curved constant cost surfaces, frv) an affine scale adjustment 
mechanism, and (v) a line-search scheme. 
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Descriptl n 



METHODS AND APPARATUS FOR EFFICIENT ALLOCATION OF RESOURCES BY OPTIMIZING 
NON-LINEAR, CONVEX FUNCTIONS WITH LINEAR CONSTRAINTS 



5 Background of the Invention 

This Invention relates to systems for optimally allocating resources among a plurality of resource utilizers. 
More particularly, this Invention relates to methods and apparatus which perform improved allocation of 
resources in a physical system whose allocations costs follow a non-linear function and whose allocated 
parameters are linearly constrained. 
10 The need for resource allocation and optimization of systems arises In a broad range of industrial and 
technological areas. Examples of such are assignment of transmission facilities in telephone transmission 
systems, control of product mix in a factory, allocation of people to do a specified set of jobs such as operator 
services provided by telephone companies, Inventory control, deployment of industrial equipment, and many 
others. 

15 Resource allocation and optimization decisions are typically subject to certain constraints on various 
decision variables. Resources, for example, are always limited in availability, and sometimes their usefulness in 
a particular application also may be limited. Thus, the problem is to allocate the resources such that the 
constraints are satisfied while at the same time the cost of allocation is minimized, or the benefits arising from 
the allocation are maximized. The cost to be minimized can be any measure selected by the user, such as 

20 minimizing expenditures, minimize delay, minimizing inefficiency, etc. 

A general method of characterizing and solving such optimization problems Is via mathematical 
programming models. That is, the various resources of the system to be optimized are represented by a 
variable (one variable for each resource assignment), the collection of variables is viewed as a vector, a 
solution for the desired state of the system to be optimized is derived by evaluating a value for the vector 

25 through some selected procedure, and, lastly, the values of the components of the evaluated vector are said to 
represent the desired values of the allocated resources. 

There are many physical optimization problems that, when modeled as described above, consist of linear 
relationships and thus belong to a class of problems and their solutions that are known as Linear Programming 
(LP). However, there Is also quite a large number of other physical optimization problems that, when modeled 

30 as described above, comprise some linear relationships and some non-liner relationships. These are not LP 
problems. For example, the total amount of cargo carried by an air carrier cannot exceed the capacity of the 
planes which the carrier owns (at one cost) and which the earner can lease (at different costs) . This situation is 
typically modeled mathematically by a set of inequality constraints, and in such a situation, the cost function to 
be minimized may very well be non-linear. For example, the cost of leasing planes will go down (per plane) as 

35 the carrier increases the number of leased planes, until the supply of leasable planes diminishes to the point 
where per plane cost of leasing will begin to rise again. Such a relationship is called convex. 

In the description that follows, physical systems where the resource allocation effort results in a 
mathematical model where the operating parameters are linearly constrained and the cost function to be 
minimized is non-linear, are referred to as "Non Linear Functions - Linear Constrains* (NLFLC) systems. Of 

40 particular importance is a subset of NLFLC systems which is characterized by a cost function that is quadratic. 
A quadratic cost function is a function that Is convex, with the additional property that a line connecting any 
two points on the function do not cross any other point on the function. This is illustrated in FIG. 1 . where curve 
10 is shown as a function of the two variables x and y. Any two points on curve 10, such as points 17 and 18, 
can be connected with a line 19, and line 19 does not intersect curve 10. Curve 10, then, Is quadratic. Curve 40, 

45 on the other hand, is not quadratic because points 39 and 38 are connected by line 46 and line 46 intersects 
curve 40 at two other points. However, line 47 which connects point 39 with point 37 does not intersect curve 
40 at any other pomt. It can be said, therefore, that curve 40 has quadratic regions. 

Resource allocation tasks that can be modeled as NLFLC problems can be visualized in a manner identical 
to that of conventional linear programming (LP) tasks as described, for example, in the seminal invention of N. 

SO Karmarkar, U.S. Patent 4,744,028, issued May 10. 1988. To wit, each parameter which can be altered in the 
process of allocating resources is viewed as a dimension in a multi-dimensional orthogonal space, and the set 
of feasible solutions is found within the volume defined by the linear constraints of the system, each of which 
describes a plane in the multi-dimensional space. The collection of the intersecting multi-dimensional planes 
forms that volume. The multi-dimensional surface formed by the intersecting multi-dimensional planes is often 

55 called a "polytope", and the sides are often called "walls*. It has been known that. In general, the optimal 
solution to a NLFLC problem lies on a wall of the pdytope. This is in contrast with the LP case (where the cost 
function is also a linear function of the parameters) where the optimal solution Oes on a "vertex* of the 
potytope. 

A good summary of prior art for solving NLFLC problems can be found in the book -Practical Optimization' 
60 by P. E. GUI, W. Murray, and M. H. Wright, published by Acad mic Press in 1981. However, these methods 
simply cannot solve very larg problems, and the probl ms that these methods are capable of solving take a 
relatively long time to reach a solution. In short none of these methods are suitable for solving even 
moderately-sized resource allocation probl ms in real-time (that Is. sufficiently fast) to provide more or less 
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continuous control of an ongoing process, system, or apparatus. The deficiency stems from the fact that these 
methods consume a lot of computer time before th y come up with an optima! allocation strategy. 

To overcome the computational difficulties associated with the known methods to solve linear programs. N. 
K. Karmarkar invented the new method disclosed in the aforementioned U.S. Patent No. 4,744,028. According 
to this method, a starting feasible point is selected from the Interior of the polytope. and a series of moves are 5 
made in th direction that, locally, points in the direction of greatest change toward the optlmai vertex of the 
polytope. This direction, which is the direction of steepest 'descent" does not depend on the parameters 
(variables) to be optimized. A step of computable size Is then taken in that direction, and the process is 
repeated until a point is reached which is close enough to the desired optimal vertex. One specific aspect of 
Karmarkar's algorithm is that the above mentioned procedures are conducted in a transformed space and the w 
outcome is mapped back into the original space at each Iteration. It is important to remember that the 
transformation performed by Karmarkar (called the 'projective transformation "> is with respect to the 
constraints of the problem. No special transformation is done (as it is not necessary) with respect to the cost 
function. Under projective transformation, the linearity of the cost function Is preserved. 

Alas, this method is not suitable for optimizing general NLFLC systems because the nonlinear cost function 15 
results in a 'direction of descent" (gradient) that is a function of the parameters (variables) to be optimized. 
Neither projective transformation nor its variants (e.g., affine scaling) rectify this situation. 

Summary of the invention 

Our invention obtains a solution for the resource allocation task in a manner that, in part, is similar to that 20 
described by Vanderbei in U.S. Patent 4,744,026, issued May 10, 1988. Specifically, the resource allocation task 
is solved iteratively, and at each iteration a transformation is performed prior to affine scaling. The 
transformation rectifies the aforementioned problems by transforming the modeled system to convert the 
convex cost function (having ellipsoidal constant cost surfaces) to spherical form (having spheroidal constant 
cost surfaces). Once transformed, affine scaDng is applied and the iteration is continued. 25 

Because of this transformation, the search for an optimal point in the polytope becomes very efficient due to 
special geometrical properties of spheres. Without this transformation, the search would be very inefficient, or 
in some cases it may even fail completely. 

Similar to what one may experience in the LP case, sometimes it is possible to get "attracted" near a wall of 
the polytope, which may not contain the optimal solution. This potential difficulty is overcome by "pushing" the 30 
"attracted" solutions away from the wrong wall, along the curved cost surface, to a better position. This 
procedure speeds up the solution procedure considerably. 

As indicated above, the QP problem (quadratic problem) Is a special case of the general NLFLC problem. 
Whereas in the QP problem the cost function changes in a relatively slow and predicatable way, in the general 
NLFLC problem the characteristic of the cost surface can change dramatically from point to point. To obtain an 35 
improved cost allocation in the presence of such a varying cost function, a line search operation is performed 
to track the local optimal point from step to step to insure that a step Is not taken that passes beyond the 
minimum cost at the chosen direction. 

Brief Description of the Drawing 40 

FIG. 1 depicts the convexity property of a function; 

FIG. 2 is a graphical representation of the special case of optimal resource allocation for a QP problem; 

FIG. 3 is a graphical representation of the general case of optimal resource allocation for a QP problem 
along with the iteration steps of the new algorithm ; 45 

FIG. 4 is a depiction of the "zlg-zagglng" phenomenon of search directions due to ellipsoidal cost 
surfaces; 

FIG. 5 is an Illustration of the computational advantage due to spherical cost surfaces; 
FIG. 6 is an illustration If the affine scaling concept; 

RG. 7 is an illustration of the potential push on curved cost surfaces for a QP case ; SO 
FIG. 8 is a graphical representation of the concepts of the "projected normal" vector and the "projected 

push* vector used for the potential push; 
FIG. 9 is an Dtustration of the modification incorporated to the push used for QP, to cater the general 

NLFLC case; 

FIG. 10 is a general flow chart which shows the procedure which separates the QP from general NLFLC 55 
problems and chooses the appropriate process to solve them; 

RG. 11 is a flow chart which describes the solution procedure (process A) of a QP problem using the 
new algorithm; 

RG. 12 is a flow chart which describes the solution procedure (process B) of a general NLFLC problem 
(otherthan QP problem) using the new algorithm; and 60 

RG. 13 Is a block diagram of a resource allocation/control system using the ideas described FIGS. 1 
through 7. 

65 
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Detailed Description 
The Task 

The following description addresses the OP problem first becaus it is simpler to understand and because it 

5 is an important subset of the NLFLC set. Thereafter, the results are extended to the g nerai class of NLFLC 
problems, and some modifications are incorporated. Also, for the sake of simplicity and taking into account of 
the limitations of human perception, all of the graphical illustrations are confined to two dimensions only. 
However, the number of dimensions (number of decision variables) In practical problems can be hundreds, 
thousands, or even millions. Moreover, without iosing generality, only cost minimization is discussed, but It Is 

10 understood that benefit maximization can be treated in a similar fashion. 

FIG. 2 presents a two dimensional polytope 11 which encompasses the set of ail operational states of the 
system. Every point within the polytope corresponds to a feasible state of allocation. Thus, point 14 Inside 
polytope 11 is a feasible point. In FIG. 2, the elliptical contours 12 and 13 are constant cost surfaces. That is, 
(e.g., state 15) the cost of allocation associated with all points on surface 13 is identical. 

15 Furthermore, the cost is smaller than the cost of allocation of all points on surface 12 (e.g.. state 14). The 
challenge in connection with the FIG. 2 system is to arrive at a state which results in the least allocation cost 
(corresponding to the inner most ellipse) and at the same time maintain feasibility (stay inside the polytope). 
This arrive-at state is the optimal solution to the allocation task, which in FIG. 2 is point 16. It may be noted that 
point 16 is the optimal allocation with no constraints considered. This is a very special case of the QP problem 

20 and such occurrence is extremely rare In real-life physical resource allocation problems. 

A more realistic QP case is illustrated by FIG. 3. The unconstrained optimum 21 is not feasible since tt is 
outside polytope 24. Therefore, the optimal state for the FIG. 3 system is point 22, which lies on wall 23 of 
polytope 24. The optimal cost is the cost associated with the ellipse 25. 
Our invention starts with an initial feasible point 26, and proceeds radially, but always staying in the interior of 

25 polytope 24 (thus maintaining the feasibility), in steps 29, 30, etc., to states 27, 28, etc., each closer to the 
optimum feasible state 22. 

The QP Situation 

In standard vector notations, a typical QP problem can be expressed as follows: 
30 minimize 

J«V 2 x T Qx-c T x (1) 

subject to a set of linear constraints 

Ax - b (2) 

and a posmvity constraint x ^ 0 on x where 
35 x is an n component vector called the "solution vector" 

Q is an n x n matrix (at least positive semi-definite) called the 'cost matrix" 
c is an n component vector called "cost vector" 
A Is an m x n matrix called the "constraint matrix* 
and 

40 b is an m component vector which depicts the constraint limits. 

The above representation of the QP problem is commonly referred to as the "standard form". See Gill et a!., 
referenced infra. All objective functions and all constraint relationships pertaining to QP proglems can be 
reduced to this form by simple algebraic manipulations, with the help of what are known. as "slack" and 
"surplus" variables. These techniques are well-known in the prior art, as described, for example, in the 
45 aforementioned 4,744,026 Vanderbei patent. 

For the special QP case depicted in FIG. 2 (where the absolute minimum is within the polytope) the optimal 
solution and the corresponding cost can be easily calculated as, 

Xoptbmnn = Q~ 1 C (3) 

and it is straightforward to compute 

50 



^ optimum ^ * C*^) 

55 

For the general case of QP problems (where the unconstraint optimum lies outside the polytope) the 
solution can proceed by following the gradient of the cost very much as it is done by Karmarkar. However, this 
procedure may be inefficient for the reasons explained below. 
With reference to FIG. 4, fines 32, and 33 are the gradients (hence the search directions) which are 

60 orthogonal to constant cost surfaces 35, and 36. Proceeding in accordance with gradient concept alone, the 
solution would start from one state, say 30, and proceed along th gradient (32) to another state, say 31 . State 
31 is selected based on the criterion that is related to the change in the gradi nt; and more specifically, based 
on a determination that the current direction (32) is tangential to another constant cost surface, such as 
surface 36. When that condition is detected, a new gradient Is computed and the solution proceeds from state 

65 31 to the next state, such as th state represented by point 48. The aim is to reach the optimal state 34 in this 
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manner. But, as can be observed, the procedure "zig-zags" considerably before reaching this point. 

Our invention overcomes this difficulty by transforming the elliptical (or ellipsoidaJ-in multi-dimensions) 
cost contours to circular (or spheroidal- in multi-dimensions) contours. Thus, with reference to FIG. 5. 
because of the special geometrical properties of sph roidaJ surfaces, the search direction 41 is always 
orthogonal to contours 42, 43. 44 etc., and this search dir ction is oriented closely toward the optimal state, 5 
45. 

The mathematical translation of the above idea is the transformation of the variable x (solution vector) to 
jf = L T x (5) 

where L is a vector that is related to the cost matrix Q by 

Q ~ LL T . (6) to 
Note that x > 0, but x* is unconstrained. 

Following this transformation, the entire problem is scaled (afflne scaling) by mapping the current state 
(point) to a common distance u. from the constraint wails, where u. is a positive scalar. This process can be 
mathematically represented by the product D~ 1 x, where D is a diagonal matrix with diagonal elements set to 
(1/u.)xi. Referring to FIG. 6 where the unsealed and scaled porytopes are shown, the current state 55 in the 15 
original (unsealed) polytope 51 is mapped to state 56 in the scaled polytope 52, forcing it to be at a distance p. 
from walls 54 in the positive quadrant. 

With some mathematical manipulations, it can be shown that the negative of the direction of movement 
(step direction) from a state x° to minimize the cost of allocation is expressed by. 

Sxp « -P - HA T (AHA T )" 1 A] H(Qx° - C) (7) . 20 

where 

H equals (XQ + 0*0-2)-*. 

0*0 is a diagonal matrix with the diagonal entries the components of x°, and 
X Is a positive 'scale* parameter, which advantageously is set to ur 2 . 

The next task Is to calculate the step length which determines the size of the step (movement) away from x° 25 
in the direction Sxp. 

One constraint, of course, is the presence of the walls of the polytope. The step length due to this constraint 
Is limited by 



0.97 

a x = (8) 



[6ip T Q5xp] 



30 



which goes takes up to 97<Vb percent of the distance to the nearest constraint wall. Another constraint, 35 
however, relates to the curvature of the cost surface. As shown in FIG. 4. while proceeding in the direction 5xp, 
a point is reached where the cost stops to decrease and begins to increase. Going beyond this point is clearly 
not advisable, and therefore, another value is computed that evaluates the step length to the minimum cost 
along the chosen direction. This step length is 



40 



45 



The actual step length is. of course, the smaller of the two, i.e.. 

a « mln(ai,a2) . . 

The new state x 1 Is obtained by evaluating, 
X 1 ~ x°-a6xp. (10) 

The expression for the descent direction, Sxp, is called the 'null space' formulation, because it is obtained by SO 
the projection of the gradient (Qx° - c) on to the null space of A. An alternate form of this would be the 'range 
space" formulation. The only difference between the two formulations is the way in which the projection matrix 
Is constructed. It can be shown that the direction for the range space formulation of the QP algorithm is given 
by 

6xp - rfzxa + irMZ^zcox 0 -c), (11) ss 
where Z is called the "orthogonal compliment" matrix of A and satisfies the feasibility condition ATv ■= 0 for 
any vector v. 

To construct Z, the matrix A may be represented by [ B I N ] where B and N are of dimensions m x m and 
mx(n-m) respectively, and then 
Z « [(-B-*N) T | Q (12) 

where I represents the "identity matrix" of dimension m. The scale parameter X can be adjusted to make the 
conv rgertce of the algorithm faster. (It can be shown that for the LP case, the scale is totally absorbed in the 
step length ai. and consequently scale adjustments are Ineffective). The way to do this is to perform a "search 
on X, but it should be noted that every time th value of X Is changed, a matrix inversion must be performed. 
Therefore, the following recursion is useful In adjusting X in the neighborhood of some X° without performing €5 
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the inversion each and every time. 

5x P = 'l^ 0 -^ o g (B) (13) 

5 n=0 
where 

gW „ pOQgO»-1> t 

W P° is the projection matrix, [I - HA^AHA*)' 1 A) H, evaluated at X°, and 

g* is the gradient of the cost function evaluated at x° which in this case is Qx°-c. 
Also, sometimes it is possible that the successive solutions of the algorithm to be near a constraint wall. 

Under that situation, the convergence of the algorithm can be slow. When this happens, a potential search 

method can be used to recent er the solution on the current cost surface. With reference to FIG. 7. 61 is the 
15 polytope and 62 is the state from which a move is made in the direction 63 to reach the state 64. which is near 

wail 65. The optimum is state 66. What is necessary to do is to slide 64 along the constant cost trajectory 67, to 

recenter at a better position, such as position 68. This Is what we call the potential search. 
The essential idea behind the potential search is to construct a "push vector' which pushes the solution 

away from the walls. Of course, the path traced by this push vector should be of constant cost, while 
20 maintaining feasibility. 

The push Is accomplished by evaluating rip which is the projected normal at Xo, and by evaluating v p which is 

the projected push direction at xo. where xo is the point that is to be pushed away from the walls. Letting xt be 

the path traced by xo so that the cost is unchanged, we compute 

n p = Z^ZOZ^Zgo (14) 
25 and 

v P - C rrZHZV'Zvo (15) 

where 

(i) Z is such that AZW - 0 f or any vector v. 

(ii) go is the gradient to the curved level set at x°, 
30 (iii) 



= f 8o T z T (ZQ2 T r 1 z go I t 

~ L v o T Z T (ZHZ T r 1 ZQZ T (ZHZ T r 1 Zv 0 J ' 



(lv) H = (XO +- D" 2 ),and 
(v) v 0 is the "Karmarkar Push Vector" obtained as 
40 voi = xoi ~ 1 -Pflej fori - 1.2, * - n 

where 8 = — — — and 
So'gp 

45 

vr- p ~ crCZHZyZxo- 1 .9* - CZ T (ZHZVZgo. 
Letting a be IIQIL where II g denotes the spectral norm, a •search" along the trajectory 
x« - xo + (cos at-1)n p + (sin a t)v p (16) 
can be performed so that the 'potential function" 

50 

-Xlog(^) 

55 is minimized. Referring to FIG. 8 as an example. 71 depicts the spherical surface. 72 and 73 are the projected 
normal and the projected push vectors at state 75 vectors, and curve 74 represents the trajectory of constant 
cost 

Generalizing to NLFLC 

60 Our QP algorithm can be generalized to other NLFLC optimization problems in systems where the cost 
function is other than quadratic but stffl convex. The task is then represented by 
mln t(x) (17) 

subject to a set of linear constraints 
Ax - b and x>0. (18) 

65 Letting g(x) be the gradient of f(x) at x. (i.e.. V(x) = g(x)). and G(x) be the Hessian matrix ot f at x (Le., 
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V 2 f(x) = G(x)), and replacing Q by G(x) and Qx - c with g(x) in our QP algorithm, yi Ids a new algorithm to 
solve the NLFLC problem. Unlike the QP algorithm, however, the Q term depends on the vector x, and this 
Invalidates our analytical xpresslon (9) for the step size aa. The step size 02 in this case is then determined by 
a fast "line search method", where th step length of desc nt is determined by successive evaluations of f(x) 
in th descent direction. 5 

Incorporating the required line search, our new algorithm for solving the NLFLC problem can be stated as 
follows: 

Choose two positive constants (J and y such that 0<(J<1andO<Y<1. *lo n 9 with a starting feasible 
solution x°. Then, compute the search direction 6xp from equation (7) or (1 1 ) by substituting G(x°) and g(x°) In 
place of Q and Qx° - c respectively. Use 02 In (9) as an estimate for the potential step size, where Q is replaced 10 
by G(x°). Lastly, obtain the true step length a such that 
f(x«-a6xp) S l(x 0 >-Yafl T (x°)6xp (19) 
where 

a - min(a 2 p h .ai) (20) 

the term ai Is given by equation (8) and h is the first positive Integer (including zero) that satisfies equation 15 
(19). It can be shown that the line search algorithm converges (reaches the optimal answer) under reasonably 
weak assumptions. 

The rest of the procedure is the same as the QP algorithm, except for the following difference in the 
potential search. While performing the potential search, we recenter along a path based on a quadratic 
approximation of f(x). This recentering scheme maintains feasibility but does not keep f(x) constant. 20 

Referring to FIG. 9 for example, state 81 is translated in direction 82 to state 83. This solution is then pushed 
along the constant cost surface 85 based on the quadratic approximation of f(x). The actual constant cost 
surface is 84. and we stop the push along the trajectory of the approximation of f{x) when the difference 
between trajectories 84 and 85 (which in this case is distance 86) exceeds a predefined value, say 6J. 
Thereupon, we start a new iteration. 25 

Finally, a stopping criterion for the new algorithm (for QP as well as NLFLC) could be 
*f* Sxp £ e (21) 

where e is some fixed small positive number and 
ll-A T (AHAY 1 AH] [OX°-c] £ 0 (22) 

30 

The Process 

The process of improving the operational state of a commercial enterprise such as improving the state of 
resource allocations of the enterprise Involves first Identifying the resources that are available, determining the 
resource allocation needs and constraints, and selecting a cost function to be minimized that makes sense for 
the particular circumstance of the enterprise. All of this information is then represented by symbols (variables) 35 
for easy manipulation, and the task is structured In terms of the represented variables as a minimization task. 
When so structured and when it is found that the constraint relationships are linear but the cost function is 
convex, then the principles of our invention can be applied to an advantage. 

In such a circumstance, according to FIG. 10 which outlines our method in broad terms, the task is 
formulated as a QP or general NLFLC task in block 91 , and in block 92 an initial feasible state of the system is 40 
selected (x°). and a scale parameter X is chosen (usually. 1.0). This is called the "phase 1" problem. As the 
constraints are linear, one can use the same phase 1 operation of LP case. This process is disclosed in the 
aforementioned U.S. Patent No. 4,744,026 issued to R. J. Vanderbei. Of course, in existing systems the task of 
chosing an initial state Is trivial since the existing state of the enterprise can be selected as the initial state. 
Continuing with FIG. 10, block 93 represents a process to detect whether the optimization problem is QP or 45 
general NLFLC. H it is QP, process A (block 94) is invoked; otherwise process B (block 95) is Invoked. Thus, 
block 96 represents the entire iterative scheme (though In very general terms) to solve all NLFLC problems. 
Lastly, step 97 makes the assignment, as determined by block 96. to place the enterprise in the neighborhood 
of the optimum state of the system. Processes A and B are described in more detail In FIGS. 11 and 12. 

Referring to FIG. 1 1 , block 101 is the detection step for determining whether the number of variables (n) is 50 
more than the number of constraints (m). if (m > n). then we proceed to block 103 to use the null space 
formulation to obtain the descent direction. Otherwise, we go to block 102 to use the range space formulation. 
Next, the two step lengths (ai and 012) are computed In block 104, and block 105 performs the translation from 
x° to x 1 In the direction of descent using the step length computed above. After that, block 106 determines 
whether a scale adjustment of the initially chosen scale value should be done, and if advisable, the process 35 
continues to block 107 where the adjustment is made, a new direction Is evaluated, and control is returned to 
block 105. When it is determined that no scale adjustment is necessary, control transfers to block 108 where, if 
necessary (that is, if a = ai), a potential push is performed In block 108 to recenter the solution, as described 
before. Finally, block 109 checks the convergence condition, such as the one expressed by equations (21 ) and 
(22). When the condition is met, process A terminates; otherwise control returns to block 101. The 60 
above-described steps are repeated until the convergence conditions are met 

Process B is very similar to process A and it is illustrated in FIG. 12. Block 110 performs a quadratic 
approximation on f(x). Blocks 111. 112, and 113 are the same as blocks 101. 103, and 102 or process A. In 
block 114. we compute th si p I ngth as defined by equation (19) and use it in conjunction with th search 
direction that Is obtained from- block 112 or 113. This op ration is done in block 115, which essentially uses 65 
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equation (20). Blocks 1 16 and 1 17 are identical to blocks 108 and 107 of process A. In block 1 18, we perform 
the approxJmat potential push which was described b f re in conn ction with FIG. 9. essentially using 
equations (14) through (16). Block 119 is identical to block 109 and the whole process B Is repeated until these 
conditions are satisfied. 

5 

The Apparatus 

FIG. 13 illustrates a process control system in accordance with the principles of our invention, It may be a 
telephone communication system, a manufacturing process, a navigation process, a financial system (e.g., 
portfolio management system), a lease of planes (as described before) arrangement, or any other industrial. 
10 technological or economic process which is to be optimized and which exhibits the "convexity" property 
discussed above. 

A cost function evaluation device 170 receives the cost data (information on how allocation cost varies) on 
bus 171 from a computer terminal or some other separate process. Device 170 aims to accommodate the fact 
that costs are not necessarily fixed and that different enterprises may choose different cost arrangements that 

15 they may wish to minimize. Device 170 fits the incoming information to a cost function representation, which 
for the systems under consideration is in general non-linear and more particularly, convex. The information is 
passed by device 170 to cost evaiuator 172 which evaluates the value of the fitted cost function at any state of 
the system. That is, given an operational state of the system represented, device 172 evaluate the cost in 
accordance with the given cost function. To evaluate the cost, device 172 must be provided with a description 

20 of the state of the system, and that information is provided by controller 185 via lead 173. The cost 
determination is passed to controller 180 via lead 174. 

Similarly, a limit register 183 is provided to store the physical limits on the allocation variables. These limits 
can be entered into the system via lead 184, like the cost data. 
The system of FIG 13 also includes sensors 186, 187, • • - , 188 which dynamically sense the constraint 

25 coefficients of the enterprise. In the airline industry, as an example, the number of planes at different locations 
win, of course, change with time, and even the number of operating airports will fluctuate. It is these 
"environmental" changes that sensors 186-188 monitor. Each constraint sensor 188-188 has a corresponding 
change detector, 189, 190, - - - , 191. The detectors detect the changes in the output of each of the respective 
sensors. A change indication signal from each of these detectors is applied to change bus 192 and thence to 

30 AND gate 193. Also applied to gate 193, on lead 194, is a signal from controller 185 Indicating the termination of 
the optimization procedure. Through a separate output port at each of the change detectors, the outputs from 
sensors 186-188 are applied to controller 185. 

Controller 185 of FIG. 13 Is, in the preferred embodiment, a programmed digital computer having stored 
therein the program which implements the flowcharts of FIGS. 10, 11 and 12, making use of the ideas 

35 presented in FIGS. 1 through 9. Controller 1 85 may also comprise a complex of hardwired circuits, designed to 
carry out the operations described by the flowcharts of FIGS. 10 through 12, or to realize the ideas of FIGS. 1 
through 8. 

Since NLFLC controller 185 of FIG. 12 utilizes the extremely fast procedures illustrated in RGS. 3, 4, or 5, 
control values are available for registers 195-197 in a very short time. Moreover, as the constraints change, 
40 these changes are sensed by sensors 186-188, detected by detectors 189-191, and used partially to enable 
AND gate, 193. When the procedures of FIGS. 10-12 are complete, controller 185 generates process control 
signals and transfers the signals to registers 195-197. Simultaneously, controller 125 generates an enabling 
signal on lead 194 to AND gate 193, completing the enablement of AND gate 193. The entire process is then 
repeated. 

45 A typical type of problem in modem control theory area to which the present invention can be applied Is 
stochastic control of systems, for example the control of robots. A description of these type of problems can 
be found in the book "Computer Controlled Systems' by K. J. Astrom and B. Wittenmark, published by 
Prentice Hall in 1986. 

Other problems which would benefit from our invention are financial portfolio management, optimal loading 
so problems of power utility companies, and optimal staff allocation problems. 

It should be noted that the matrices involved in most practical NLFLC cases are sparse in nature and sparse 
matrix techniques can be used in the operations depicted In FIGS. 10 through 12. 



55 Claims 

1 . A method for allocating industrial facilities in an enterprise so as to reduce the cost associated with a 
selected allocation of said facilities in said enterprise, where said allocation of said facilities Is linearly 
constrained by a set of constraints and said cost Is expressed in terms of a given convex function, said 

60 method comprising facilities allocation iterations where each iteration receives a potential facilities 

allocation having an associated cost.and develops a potential facilities allocation for the next iteration as 
long as a selected stopping criterion Is not met. each iteration further comprising the steps of: 
performing a scaled spheroidal transformation that op rates on representation of said received potential 
facilities allocation to form a transformed repr s ntation of said potential facilities allocation such that 

65 . components of said transformed representation of said potential facilities allocation ar equal (affin 
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scaling), and th sum of squared components of transformed representations of other facilities 
allocations having the same associated cost Is equaJ to the sum of squared components of said potential 
facilities allocation (spheroidal transformation) ; 

developing a translation vector for use in creating said new potential facilities allocation for the next 
iteration; tj 5 

testing the components of said translation vector to develop a process-stop signal when said translation 
vector falls below a preselected threshold; 

developing said new potential facilities allocation for the next iteration when said translation vector fails to 
fall below said preselected threshold; and 

applying the last developed new potential facilities allocation to said enterprise when said translation 10 
vector fells below a preselected threshold. 

2. A method for allocating Industrial facilities In an enterprise so as to reduce the cost associated with a 
selected allocation of said facilities, where said allocation of said facilities is linearly constrained and said 
cost is expressed in terms of a given convex function, said method comprising facilities allocation 
Iterations where each iteration receives a potential facilities allocation, and develops a potential facilities is 
allocation for the next iteration as long as a selected stopping criterion is not met, each Iteration further 
comprising the steps of : 

transforming the representation of the potential facilities allocation, vector x°, to develop a spheroidal 
cost function for said enterprise such that the sum of squared components of the transformed facilities 
allocation representation (components of X*) is a constant value and the sum of squared components of 
representations of transformed other facilities allocations (components of x 1 ',** • • •) having the same 
cost as the cost of said potential transformed facilities allocation (x*) is equal to said constant value; 
scaling the representation of the potential facilities allocation, x°, to move the potential facilities allocation 
to a scaled potential facilities allocation represented by vector x°" and having identical scaled facility 
allocations (components of vector x°") ; 25 
developing a translation vector for use In creating said new potential facilities allocation for the next 
iteration; 

testing the components of said translation vector to develop a process^stop signal when said translation 
vector falls below a preselected threshold ; 

developing said new potential facilities allocation for the next iteration when said translation vector fails to 30 
fall below said preselected threshold; and 

applying the last developed new potential facilities allocation to said enterprise when said translation 
vector falls below a preselected threshold. 

3. The method of claim 1 where said step of performing a scaled spheroidal transformation comprises 

the steps of ^ 35 

transforming the representation of the potential facilities allocation to develop a spheroidal cost function 
such that the sum of squared components of the transformed facilities allocation representation is a 
constant value and the sum of squared components of representations of transformed other facilities 
allocations having the same cost as the cost of said potential transformed facilities allocation is equal to 
said constant value, followed by the step of 40 
scaling the representation of the potential facilities allocation, x°, to move the potential facilities allocation 
to a scaled potential facilities allocation and having identical scaled facility allocations. 

4. The method of claim 1 where said step of performing a scaled spheroidal transformation comprises 
the steps of 

scaling the representation of the potential facilities allocation. x» to move the potential facilities allocation 45 
to a scaled potential facilities allocation and having identical scaled facaity allocations, followed by the 
step of 

transforming the representation of the potential facilities allocation to develop a spheroidal cost function 
such that the sum of squared components of the transformed facilities allocation representation is a 
constant value and the sum of squared components of representations of transformed other facilities SO 
allocations having the same cost as the cost of said potential transformed facilities allocation is equal to, . 
said constant value. 

5. A resource allocation system for allocating facilities of an enterprise so as to reduce the cost of 
operating said enterprise with a selected allocation of said facilities, where said allocation of said facilities 

Is linearly constrained and said cost is expressed in terms of a given convex function, comprising: 35 
an N plurality of sensors, where N Is a positive Integer, for determining the existing state of allocations of 
said facilities; 

means for applying an allocation of said faciOtles; 

means responsive to said sensors for storing output signals of said sensors in an alterable memory, 
thereby forming a stored state of allocations of said facilities, said state having N components; 60 
transform means responsive to said means for storing for performing a scaled spheroidal transformation 
of said stored state of allocations to form a transformed allocation state signal such that components of 
said transformed allocation state signal are equal, and the sum of squared components of transformed 
repr sentations of other facflitales allocation states having the same associat d cost Is equal to the sum 
of squared components of said transformed allocation state signal ; 65 
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translation menas responsive to said transform means for developing a translation vector signal; 
comparison means responsive to said translation means for developing a process-stop signal when said 
translation vector signal falls below a preselected threshold; 

means responsive to said comparison means for developing an improved facilities allocation state and 
storing said Improved facilities allocation state In said alterable memory; and 

means responsive to said process-stop signal for applying most recent one of said improved facilities 
allocation states to said means for applying an allocation of said facilities. 

6. A system for optimizing the performance of controlled process in accordance with an optimization 
criterion, said system comprising: 

process control devices for controlling said process in response to a control signal set through control 
signals; 

a plurality of sensors for sensing variable conditions affecting the operation of said process via input 
signals; 

a non-linear programming controller interposed between said sensors and said process control devices 
for developing control signals applied to said process control devices, where the developing of said 
control signals is done in accordance with said optimization criterion and where said optimization 
criterion is a non-linear convex function of said input signals; 

said controller including means for iteratlvely identifying successive tentative strictly feasible control 
signal sets and selecting each tentative control signal set in the direction of the steepest gradient of a 
normalized version of the optimization criterion in a space where the constraints on the control signals are 
also normalized. 

7. A method for improving the operational state of a commercial enterprise characterized by a number 
of resources xi. (i = 1 ,n) among a plurality of physical resource users subject to constraints Aqxi = bj and 
xi £ 0 (I = 1 ,n;j = 1 t m) in such manner to optimize a cost function (1/2)X{QqXi - c»xi (i = 1 ,n;j 1 ,n), said 
method comprising the steps of: 

(a) selecting an initial allocation x° => (x5,xj, • • - xj) meeting the said constraints, 

(b) using the transformation x 1 - L T x where 

Q « LIT to make the constant cost surfaces spherical, 
_ (c) using the scaling y — DrV where 

D = diag(1/u.Kx?>4r • -x^) to place x° at a distance u. from the constraint walls, 

(d) computing the descent direction 5x p by using the relationship 
6xp « -[I - HA T (AHA T )" 1 A]H(Qx° - c) where 

H <=* (XQ + D*-*)- 1 ,!)** is the diagonal matrix containing the components of x°, 

X o ur 2 , and 

I is the identity matrix, or 

8xp «= - Z T (Z(XQ + t)-**o)Z*)-*Z{Cufi - C), 

where Z «= [(-B~ 1 N) T I], where matrix A »[BN] 

(e) computing the step length a — rnin (01,02), where 

„ 0.97 anH „ _ fSxpW - c)] 
ct\ = and Obx = — = , 

(f ) selecting the new allocation x 1 by x 1 — x° - a6xp, 

(g) adjusting X for maximum cost reduction, 

(h) recentering In the new constant cost surface if a « ai using the potential push described in 
the detailed description of the algorithm, 

(i) testing for the criterion "P* 8x« S e (where e is some fixed small positive number). 
(J) testing the criterion p - A T {AHAV AH] [QX° - c] > 0 

(k) stopping the iterative scheme If (i) and (j) are satisfied, or 

(I) going back to step (d). replacing x° by x 1 are repeating, 

(m) allocating said resources In accordance with x 1 if (i) is satisfied. 

8. A method for allocating industrial, financial, or technological resources Xj (i » 1,n) among a plurality 
of physical resource users subject to constraints Aqxj = bj and xi 2; 0 (i = 1 ,n.) = 1 ,m) in such manner to 
optimize a general convex non-linear cost function f(x), said method comprising the steps of: 

(a) making a quadratic approximation on f(x) and performing the steps of (a) through (e) of claim 
10. 

(b) calculating the step length a as per the lin search rule 
f(x° - oBxp) S f(x°) -Yag T (x°)6xpWh re 

a = min (a20 h .ai) where h is the first posffiv integ r (including zero) that satisfies the said line 
search rule, 

(c) obtaining the new allocation as per (0 of claim 10, 

(d) performing a scale adjusment as in (g) of claim 10, 
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( ) recentering on constant cost surface as of (h) of claim 10, with the modification that the 
translation should be stopped, also taking into account the difference between the actual functional 
value and its quadratic approximation, 

(g) allocating the resources like (m) of claim 10. 

9. Apparatus for allocating resources in an optimal manner among users of said resources comprising 5 
means for receiving information concerning said users, cone ming the availability of said resources, and 
concerning the constraints on the allocation of said resources, 

means, utilizing the algorithm of claim 7 or claim 8, for iteratively approximating the optimal allocation of 
said resources among said users. 

means for allocating said resources in accordance with the last option of allocating said resources w 
approximated by said iterative approximating means. 

10. The method of claim 1 where said step of developing said new potential facilities allocation for the 
next iteration comprises a line search along the direction of said translation vector. 

11. The method of claim 1 where said step of developing said new potential facilities allocation for the 

next iteration comprises a line search along the direction of said translation vector to identify a potential 15 
facilities allocation whose cost is not higher than the cost of a potential facilities allocation further along 
said translation vector direction. 

12. The method of claim 1 where said step of developing said new potential facilities allocation for the 
next iteration comprises a One search along the direction of said translation vector to identify a potential 
facilities allocation whose cost Is not higher than the cost of a potential facilities allocation further along 20 
said translation vector direction and which Is not closer than a preselected threshold to violating any one 

of said constraints. 
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